Extracting the Lowest-Frequency Words: Pitfalls and Possibilities
نویسندگان
چکیده
منابع مشابه
Extracting the Lowest Frequency Words: Pitfalls and Possibilities
s or to the complete newspaper corpus. This raises the question of whether better results might have been obtained if the complete data sets had been used. In principle, more data might imply more power. At the same time, more data also entails the risk of more noise. At least for our af data, enlarging the complement leads to worse performance. When we allow any sentence that contains af in ou...
متن کاملWords , the World , and Their Possibilities
By the principle of possibilities, we understand what an entity is with reference to what it could have been. The word red, for example, belongs to both a domain of lexical possibilities (all English words) and a domain of conceptual possibilities (all conceivable denotations). But on any occasion, the word is intended to be understood against much narrower domains. Speakers and addressees rest...
متن کاملCommentary: describing differences--possibilities and pitfalls.
Reports of attempts to investigate, characterize, compare, and contrast those who are mentally ill fill the literature and invite controversy. It seems to be part of human nature to reestablish and define the differences between us. Creative descriptive studies continually challenge our perspective, yet they must be balanced with thoughtful consideration of possible selection bias, an understan...
متن کاملTeaching Creative Interface Design: Possibilities and Pitfalls
Interface design is an essential aspect of any interactive system and thus a core component of most Human-Computer Interaction (HCI) curricula. Teaching creative interface design is, however, a challenging task, as it involves both an understanding of HCI theory and practice. A trade-off exists between enforcing the use of standard design aids such as guidelines and patterns, or encouraging the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computational Linguistics
سال: 2000
ISSN: 0891-2017,1530-9312
DOI: 10.1162/089120100561719